Identifying All Distinct Sample P-P Plots, with an Application to the Exact Finite Sample Distribution of the L1-FCvM Test Statistic

نویسندگان

  • Jeroen Hinloopen
  • Rien Wagenvoort
چکیده

P-p plots contain all the information that is needed for scaleinvariant comparisons. Indeed, Empirical Distribution Function (EDF) tests translate sample p-p plots into a single number. In this paper we characterize the set of all distinct p-p plots for two balanced sample of size  absent ties. Distributions of EDF test statistics are embedded in this set. It is thus used to derive the exact finite sample distribution of the L1-version of the Fisz-Cramér-von Mises test. Comparing this distribution with the (known) limiting distribution shows that the latter can always be used for hypothesis testing: although for finite samples the critical percentiles of the limiting distribution differ from the exact values, this will not lead to differences in the rejection of the underlying hypothesis.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

On the Exact Finite Sample Distribution of the L 1-FCvM Test Statistic

We derive the exact finite sample distribution of the L1-version of the Fisz-Cramér-von Mises test statistic (L1-FCvM). We first characterize the set of all distinct sample p-p plots for two balanced sample of size  absent ties. Next, we order this set according to the corresponding value of L1-FCvM. Finally, we link these values to the probabilities that the underlying p-p plots emerge. Compa...

متن کامل

The -Version of the Cramér-von Mises Test for Two-Sample Comparisons in Microarray Data Analysis

Distribution-free statistical tests offer clear advantages in situations where the exact unadjusted p-values are required as input for multiple testing procedures. Such situations prevail when testing for differential expression of genes in microarray studies. The Cramér-von Mises two-sample test, based on a certain L-distance between two empirical distribution functions, is a distribution-free...

متن کامل

Determining the sample size required to compare vegetation and soil characteristics in two independent groups using effect size

Extended Abstract Background and objectives: One of the important steps in assessing rangeland vegetation is determining the sample size. Adequacy of sample size and its determination is always one of the main concerns of rangeland vegetation analyzer. There are two general methods for determining the sample size in rangeland science: graphic and statistical methods. In this study, the sample...

متن کامل

MAFsnp: A Multi-Sample Accurate and Flexible SNP Caller Using Next-Generation Sequencing Data

Most existing statistical methods developed for calling single nucleotide polymorphisms (SNPs) using next-generation sequencing (NGS) data are based on Bayesian frameworks, and there does not exist any SNP caller that produces p-values for calling SNPs in a frequentist framework. To fill in this gap, we develop a new method MAFsnp, a Multiple-sample based Accurate and Flexible algorithm for cal...

متن کامل

High Dimensional Correlation Matrices: CLT and Its Applications

Statistical inferences for sample correlation matrices are important in high dimensional data analysis. Motivated by this, this paper establishes a new central limit theorem (CLT) for a linear spectral statistic (LSS) of high dimensional sample correlation matrices for the case where the dimension p and the sample size n are comparable. This result is of independent interest in large dimensiona...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010